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Abstract. Degree-based graph construction is an ubiquitous problem in network 
modeling [1, 2], ranging from social sciences to chemical compounds and biochemical 
' reaction networks in the cell. This problem includes existence, enumeration, exhaustive 

. construction and sampling questions with aspects that are still open today. Here we 

give necessary and sufficient conditions for a sequence of nonnegative integers to be 
realized as a simple graph's degree sequence, such that a given (but otherwise arbitrary) 
. set of connections from a arbitrarily given node are avoided. We then use this result 

^ ' to present a swap-free algorithm that builds all simple graphs realizing a given degree 

■ sequence. In a wider context, we show that our result provides a greedy construction 

\ method to build all the /-factor subgraphs (Tutte, [3]) embedded within Kn\Sk, where 

Tj" ' Kn is the complete graph and Sk is a star graph centered on one of the nodes. 
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1. Introduction and Sumniciry 

In network modeling of complex systems [1, 2], one usually defines a graph with 
components of the system being represented by the nodes, and the interactions amongst 
the components being represented as the links (edges) of this graph. This graph is 
usually inferred from empirical observations of the system and it is uniquely determined 
if one can specify all the connections in the graph. Occasionally, however, the data 
available from the system is incomplete, and one cannot uniquely determine this graph. 
In this case there will be a set Q of graphs satisfying the data, and one is faced with 
the following problems: 1) construct a graph from Q\ 2) count the number of elements 
(graphs) in ^; 3) construct all graphs from Q and 4) construct a typical element of 
often interpreted as a uniform random sample taken from Q. Problems 1),3) and 4) 
are construction type problems, whereas 2) is an enumeration type problem. In this 
paper we restrict ourselves to simple, undirected graphs, that is, any edge connects a 
single pair of distinct nodes (no hypergraph, no self-loops) and there is at most one edge 
incident on any pair of nodes (no parallel or multiple edges). 

A rather important and typical situation is when the empirical data specifies only 
the degrees of the nodes, in form of a sequence d = {di, c?2, • • • , dn} of positive integers, 
di > 1, 1 < i < n. (Wc exclude zero degree nodes, since they represent isolated points.) 
In the following we will refer to such CclSGS clS "degree based" construction (enumeration) 
problems. There are numerous examples, we will mention only a few here. In epidemics 
studies of sexually transmitted diseases [4] the data collected is from anonymous surveys, 
where the individuals specify the number of different partners they have had in a given 
period of time, without revealing their identity. In this case, the epidemiologist is 
faced with constructing the most typical contact graph obeying the empirical degree 
sequence. Another example comes from chemistry where the task is to determine the 
total number of structural isomers of chemical compounds, such as alkanes. In this 
case nodes represent chemical elements (atoms) in the compound and a link represents 
a chemical bond. In the case of alkanes the bond can be interpreted as a single link 
in the corresponding graph (no double bonds). Since the valence of an atom is fixed, 
the formula of an alkane such as C4H1Q (butane) will specify only the degree sequence. 
Knowing all the possible graphs with this degree sequence provides a starting point from 
which the feasible structures can be inferred. In particular, butane has 2, octane (Cg-ffig) 
has 18, C2QH42 has 366,319 isomers, etc. Degree-based graph construction is also found 
in many other network modeling problems, such as communications (Internet, www, 
peer-to-peer networks,) biology (metabolic networks, gene transcription, etc.) ecology 
(food webs) and social networks. 

It is easy to see that not all integer sequences can be realized as the degrees of 
a simple graph (the existence problem). For example, while {2,1,1} and {2,2,2} are 
the degree sequences of a path (P3, •-•-•) and a triangle, there is no simple graph 
with degree sequence {3, 2, 1} or {1, 1, 1} or {4, 4, 2, 1, 1}. Let G{V, E) denote a simple 
graph where V — {v-i,V2, ■ ■ ■ ,Vn} denotes the set of nodes and E the set of edges. 
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Consider a sequence of positive integers d — {di,d2, ■ ■ ■ ,dn} arranged in decreasing 
order, di > d2 > ■ ■ ■ > dn (for convenience reasons, only). If there is a simple graph 
G{V, E) with degree sequence d, then we call the sequence d a graphical sequence 
and in this case we also say that G realizes d. A second observation is that given a 
graphical d (and thus, we know that a simple graph G exists with this degree sequence), 
careless connections of pairs of nodes may not result in a simple graph. For example, 
consider the sequence {2,2,2,2} which is graphical (4-cycle). Making the connections 
{{vi-,V2).,{vi^vz)^{v2.,vz)} however, will force us to make a self-loop {{vi^,v^}. The 
degree-based graph construction problem for simple undirected labelled graphs thus can 
be announced as follows: 

Degree-based Graph Construction: 

Given a sequence of integers d — {di, d2, ■ ■ ■ , dn}, di > . . . > dn > 1, 

A) Is there a simple graph G{V, E) on n-nodes reahzing d? 

B) If the answer to A) is yes, how can we build such a graph? 

C) Can we build all such graphs? 

D) Let Q{d) be the set of all such graphs. How can we sample at uniform from Q{d)7 

There are two well-known theorems that answer question A) above, namely the 
Erdos-Gallai theorem [12] and the Havel-Hakimi theorem [13, 14], the latter also giving 
a construction algorithm for a graph with degree sequence d and thus answering question 
B) as well (see Section 2). In principle, problem C) can be resolved via the method of 
edge swaps starting from the graph produced by the Havel-Hakimi procedure from B) 
(called an HH-graph from now on) and book-keeping the swaps (which gets rather 
involved). Given two edges {vi,V2) and (^3,^4), they can be swapped into {vi,Vs) 
and (t'2,1'4), or {vi,V4) and (^2,^3) leaving the degree sequence unchanged. Due to 
a theorem by Ryser [5], if Gi and G2 are two simple graphs with identical degree 
sequences, then there is a sequence of edge swaps that transforms one into another [6, 7]. 
Edge swapping is also at the basis of all sampling algorithms attempting to answer D), 
using a Markov Chain Monte Carlo approach, the literature of which is too extensive 
to be reviewed here. The basic idea is to keep swapping edges until the memory of 
the initial condition (HH-graph) is lost and one produces a (pseudo)-random instance. 
This sampling method is approximative and it is not well controlled in general (except 
for some specific sequences). A simple and direct (swap-free) construction method to 
produce a uniformly sampled random graph from Q{d) was presented by Molloy and 
Rccd (M-R) [8] (see Section 5) and subsequently used to generate graphs with given 
degree sequences [9], including those described by power-law degree distributions [10]. 
The problem with the M-R algorithm is that it can become very slow due to rejections 
caused by self- loops and parallel edges (see Section 5). 

Here we present a new approach to degree-based graph construction. First we 
prove our main result that gives the sufficient and necessary conditions for a sequence 
of integers to be graphical such that a given (but otherwise arbitrary) set of connections 
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from a given (but otherwise arbitrary) node are avoided. We then show how to use 
this resuh. to present an algorithm that builds all graphs from Q{d) (question C)). It 
is important to emphasize, that our algorithm does not use edge swaps, it is a direct 
construction method. Lastly, (Section 5) we show how our result improves on the M-R 
method of uniform sampling, allowing to reject some of the samples without actually 
getting to the point where the multi-edge conflicts would actually occur (see Section 
5). We also make a connection with the renowned Tutte's /-factor theorem [3, 11], 
showing that our result provides a greedy algorithm for constructing all /-factors in 
case of Kn \ Sk where is the complete graph on n nodes and Sk is a star graph with 
k leaves centered on some arbitrary node. 

The paper is organized as follows: Section 2 recalls known fundamental theorems for 
graph construction, and via a simple counter-example it shows that HH is not sufficient 
to build all graphs from Q{d); Section 3 presents our main theorem with its proof and 
Section 4 describes the algorithm for building all graphs in Q{d); Section 5 is devoted 
to discussions. 

2. Previous results 

For simplicity of the notation we will identify node Vi by the integer i. There are two 
well-known necessary and sufficient conditions for a sequence of nonnegative integers 
to be graphical: one was given independently by Havel [13] and Hakimi [14] while the 
other is due to Erdos and Gallai [12]. We now announce these results for later reference, 
however, without proof, those can be found in the corresponding references. 

Theorem 1 (Hakimi-Havel, HH) There exists a simple graph with degree sequence 
di > 0, d2 > ... > dn > if and only if there exists one with degree sequence 

d2 — 1, ■ ■ ■ , (irf^+l — 1, C?di+2, ■ ■ ■ ,dn- 

Theorem 2 (Erdos-Gallai, EG) Let di > ^2 > > dn > be integers. Then they 

are the degree sequence of a simple graph if and only if 

(i) di + ... + dn is even 

(ii) for all k — 1, ...,n — 1 we have 

k n 

^di<k{k-\)+ ^ min{A;, di} . (1) 

i=\ i=fe+l 

Note that Theorem 1 provides a greedy algorithm (the HH-algorithm) to generate an 
actual graph with the given degree sequence d while Theorem 2 is an existence result. 
Tripathi and Vijay have recently shown [15] that it is enough to check the inequalities 
(1) for 1 < A; < s, where s is determined by dg > s, dg+i < s -|- 1, that is only as many 
times as many distinct terms are in the degree sequence. 

In the following we will imagine the given degree sequence as a collection of stubs: 
at each vertex i there are di stubs ( "half-edges" ) , anchored at the vertex, but the other 
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ends are free. Connecting two stubs at two distinct nodes will form an edge between 
those nodes. We will call the residual degree the number of current stubs of a node. 

The HH-algorithm for constructing a graph realizing a graphical sequence d 
proceeds as follows: connect all stubs of a node to nodes that have the largest residual 
degrees and repeat until no stubs are left. It is important to emphasize, that one can 
choose any node to connect its stubs, as long as we connect all its stubs to the other 
nodes with the largest residual degrees. Clearly, if we always choose a high degree 
node (from the residual sequence) to connect its stubs, the HH-algorithm will create 
a graph in which high degree nodes tend to be connected to other high degree nodes, 
called assortative property [16]. However, if we always pick a node with low (residual) 
degree to connect, we will likely obtain a graph with dissassortative property [17]. The 
HH-theorem is also a consequence (as a corollary) of our main result, see Section 3. 
Nevertheless, this is still not enough to produce a// graphs realizing a graphical sequence! 
To see that, consider the graphical sequence d = {3, 3, 2, 2, 2, 2, 2, 2}. 




Figure 1. This graph cannot be obtained by the Havel-Hakimi procedure. The integers 
indicate node degrees. 

If the first node to connect is a node with degree 3, then the HH-algorithm connects 
it to the other node with degree 3 (highest degree). If the first node to connect has degree 

2, then the HH-algorithm connects both its stubs to nodes with degree 3. However, the 
graph in Fig. 1 does not have any of the connections just mentioned (a 3 — 3 or 3 — 2 — 3 
connection), and thus it cannot be constructed with the HH-algorithm. In the next 
section we introduce a theorem that allows us to construct all labelled graphs with a 
given degree sequence. 

The above results are naturally placed in the larger context of Tutte's famous /- 
factor theorem [3]. Given an integer function / : y — > N U {0}, the f -factor of a given 
simple graph G{V, E) is a subgraph H of G such that dniv) = f{v) for all v ^V. Here 
dniv) is the degree of v within H. Tutte gave sufficient and necessary conditions for 
the existence of an /-factor for G [3], and later connected this to the problem of finding 
perfects matchings in bipartite graphs [11]. It is not hard to sec, that taking G = Kn, 
that is the complete graph on n-nodes, the /-factor problem is exactly question A) of 
the degree-based construction problem with d — {f{vi), . . . , f{vn)}- In this sense, the 
HH-algorithm is a greedy method for constructing an /-factor on K^. 

3. Graphical sequences with constraints from a single node 

Before we can announce and prove our main result, we need to introduce a number of 
definitions and observations. 
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Definition 1 Let A{i) be an increasingly ordered set of di distinct nodes associated with 
node i: A{i) — {a^ | e V, i, \fk, 1 < k < di}. 

Usually, this set will represent the set of nodes adjacent to node i in some graph G, 
therefore we will refer to A{i) as an adjacency set of i. 

Definition 2 If for two adjacency sets A{i) — {. . . , a^, . . .} and B{i) — {. . . , 6^, . . .} 

we have bk < Ofc for all 1 < k < di, we say that B{i) < A{i). 

In this case we also say that B{i) is "to the left" of A{i). 

Definition 3 Let di > d2 > ■ ■ ■ > dn > 1 be a graphical sequence, and let A{i) be an 
adjacency set of node i. The degree sequence reduced by A{i) is defined as: 

{4-1 if ke A{i) 
4 if ke[l,n]\{A{^U{t}) (2) 

if k^i. 

In other words, if A{i) is the set of adjacent nodes to i in the graph G, then the reduced 
degree sequence d'\^(^-^ is obtained after removing node i with all its edges from G. 

Lemma 3 Let {di, . . . , dj, . . . , d^, ■ ■ ■ , dn} be a non-increasing graphical sequence and 
assume dj > 4- Then the sequence {di, . . . , — 1, . . . , 4 + 1, • • • , 4} is also graphical 
(not necessarily ordered). 

Proof. Since dj > 4; there exists a node m connected to node j, but not connected 
to node k. Let us cut edge {m,j) and remove the disconnected stub of j. If 
we add one more stub to k, and connect this new stub to the disconnected stub 
of m, then we can see that the new graph is also simple with degree sequence 
{4, 4, dj-1, . . . , 4 + 1, . . . , 4}. □ 

Lemma 4 Let d — {4, 4, •••,4}? be a non-increasing graphical sequence, and let 
A{i), B{i) be two adjacency sets for some node i E V, such that B{i) < A(i). If the 
degree sequence reduced by A{i) (that is d'\^^.^) is graphical, then the degree sequence 
reduced by B{i) (that is d'l^f^^-^) is also graphical. 

Proof. Let A{i) = {. . . , a^, . . .} and B{i) = {. . . ,bk, ■ ■ ■}, k = 1, . . . ,di. Consider the 
adjacency set B^{i) = 02, 03, . . . , a^.} (we replaced node ai by node bi < ai). If 
bi = ai then there is nothing to do, wc move on (see below). If bi < ai then conditions 
in Lemma 3 are fulfilled. Namely, bi < ai implies 4i > 4i > 4i — 1 and we know that 
the sequence <i'|^^j^ = {. . . , 4i > • • • > 4i ~ 1? • • • > 42 ~ I7 • • •} is graphical by assumption. 
Thus, according to Lemma 3, the sequence {. . . , 4i — 1, • • • , 4i, • • • , 42 ~ 1; • • •} is also 
graphical, that is the one reduced by the set B^{i). Next, we will proceed by induction. 
Consider the adjacency set B'^{i) — {bi, . . . ,bm, Qm+i, 0"m+2, • • • , a^i} and assume that 
the degree sequence reduced by it (from d) is graphical. Now, consider the adjacency set 

= {bi,..., bm+i, am+2, am+3: • • • , «<ii} (replaced a^+i by b 
Lemma 3 can be applied again since fem+i < Om+i implies 4„+i > 4™+i > 4„+i — 1, 
showing that the sequence reduced by B"^~^^{i) is also graphical. The last substitution 
(m + 1 = di) finishes the proof. □ 
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Definition 4 Let d = {di, d2, ■ ■ ■ , dn} be a decreasing graphical sequence and consider 
an arbitrary node i G V, and an arbitrarily fi^xed integer m with < m < n—l—di. Let us 
fix a set of nodes X{i) = {ji, . . . , j^} C V \ {i} and consider the set L{i) = {/i, . . . , /d-} 
containing the di lowest index nodes not in X{i) and different from i. We call L{i) the 
leftmost adjacency set of i restricted by X{i). Accordingly, we call the set of nodes X{i) 
the set of forbidden connections for i. 

Lemma 5 If d = {di, d2, ■ ■ ■ , dn} is a decreasing graphical sequence, and Y{i) — 
{yi, ■ ■ - Vdi} is an adjacency set disjoint from X{i) U {i}, then L{i) < Y{i). 

Proof. This is immediate, since by Definition 4, Ij < yj, for all j — {1, . . . , di}. □ 
We are now ready for the main theorem: 

Theorem 6 (Steir-constrained graphical sequences) Let di > d2 > ■ ■ - dn > 1 be 

a sequence of integers. For an arbitrary node i & V define a set X{i) — {ji, . . . , jm} C 
y \ {z} with m < n — 1 — di, and consider L{i), the leftmost adjacency set of i restricted 

by X{i). Then the degree sequence d = {di, . . . ,dn} can be realized by a simple graph 
G{V, E) in which {i,j) ^ E, for all j e X{i), if and only if the degree sequence reduced 
by L{i) is graphical. 

Proof. "<^" is straightforward: add node i to the reduced set of nodes, then connect it 
with edges to the nodes of L{i). Thus we obtained a graphical realization of d in which 
there are no connections between i and any node in X. 

In this case d is graphical with no links between i and X{i), and we have to show 
that the sequence obtained from d by reduction via L{i) is also graphical. However, d 
graphical means that there is an adjacency set A{i) (with A{i) fl X{i) = 0) containing 
all the nodes that i is connected to in G. Thus, according to Lemma 5, we must have 
L{i) < A{i). Then, by Lemma 4, the sequence reduced by L{i) is graphical. □ 
Note, that, the forbidden set of connections form a star graph S\x{i)\ centered on 
node i. Also note that considering the empty set as the set of forbidden nodes, X{i) — 0, 
we obtain the Havel-Hakimi Theorem 1 as corollary. 

Informally, Theorem 6 can be announced as follows: 

Let d = {di,d2, ...,(i„}, be a decreasing graphical sequence and let i be a fixed, but 
arbitrary vertex. Assume we are given a set of forbidden connections in V incident on i. 
Then there exists a realization of the degree sequence avoiding all forbidden connections 
if and only if there also exists a realization where i is connected with vertices of highest 
degree among the non-forbidden ones. 

Since the forbidden connections emanating from a node i form a star graph Sk, 
k — \X{i)\, Theorem 6 provides sufficient and necessary conditions for the existence 
of an /-factor for G — Kn\Sk. More importantly, it gives a greedy algorithm for finding 
such an /-factor. 
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4. Building all graphs from Q{d) 

As we show next, Theorem 6 provides us with a procedure that allows for the 
construction of all graphs realizing the same degree sequence. 

Consider a graphical degree sequence d on n nodes. Certainly, we can produce all 
graphs realizing this sequence by connecting all the stubs of a chosen node first, before 
moving on to another node with stubs to connect (that is we finish with a node, before 
moving on). In this vein, now choose a node i and connect one of its stubs to some other 
node ji. Is the remaining degree sequence d' = {di, . . . ,di — 1, . . . ,dj^ — 1, . . . ,dn} still 
graphical such that nodes i and ji avoid another connection in subsequent connections 
of the other stubs? Certainly, as a necessary condition, d' has to be graphical as a 
sequence, since all subgraphs of a simple graph are simple, and thus if G is a simple 
graph realizing d with G E, then after removing this edge, the remaining graph is 

still simple. However, it is not sufficient that after making some connections from a node, 
the residual sequence to be graphical. One might still be forced to make multiple edges, 
as illustrated by the following example. Consider the graphical sequence {2,2,1,1} (the 
path P4, •-•-•-•) as the degrees of the set of nodes V = {u, v, x, y} {du = d^ = 2, 
dx = dy = 1). Connect nodes u and v. We certainly have not broken the graphical 
character yet, since wc could still finish the path by connecting next u to x (or to 
y) and v to y (or to x). The remaining sequence {1,1,1,1} as a sequence of integers 
is graphical (two edges). Next, connect node x to node y. The remaining (residual) 
sequence is {1,1} (emanating from node u and v, respectively), graphical on its own, 
however, we can no longer connect nodes u and v, because the very first connection is 
already there. Thus, after making one, or more connections from a node i, how can we 
check that the next connection from i will not break the graphical character? 

Theorem 6 answers this question if we think of the connections already made from 
node i as forbidden connections. That is, after the first connection of i to ji take d' as 
d in Theorem 6 and X{i) = {ji}. Then, to test whether the sequence reduced by the 
corresponding L{i) is graphical we can employ for example the Erdos-Gallai Theorem 
2, checking all the inequalities, or the Havel- Hakimi Theorem 1. If the test fails on 
the reduced sequence, one must disconnect i from ji and reconnect it somewhere else. 
The graphical character of the original sequence guarantees that there is always a ji 
where the test will not fail. If, however, the remaining degree sequence is graphical with 
the constraint imposed by X(i), we connect another stub of i to some other node j2 
(different from ji), adding an element to the forbidden set of connections X{i). To check 
whether after the second connection the remaining sequence is still graphical with the 
constraint imposed by the new set X{i) = {ii, J2}, we proceed in exactly the same way, 
using Theorem 6, repeating the procedure until all the stubs of node i arc connected 
away into edges. After this we can move on to some other node i (arbitrary) from the 
remaining set of nodes and repeat the procedure. Note that this procedure is not a 
real procedure in the sense that it does not prescribe which stubs to connect. It only 
tells us whether the connection we just made (by whatever process) has broken the 
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graphical character. Since every element from G{d) can be realized by some sequence 
of connections, it is clear that if we specify a systematic way of going through all the 
possible connections while employing Theorem 6, we will realize all elements of Q{d). 
However, taking all possible connections would be very inefficient. Next we present a 
version of a more economical algorithm that constructs every labelled graph with degree 
sequence d, and only once. For simplicity of the notation we will call the test for the 
graphical character via Theorem 6, the "CG test" (constrained graphicality test). The 
algorithm also exploits Lemma 4, which guarantees preservation of graphicality for all 
adjacency sets to the left of a graphical one, thus avoiding costher checks with EG or 
HH theorems for those adjacency sets. Clearly, a labeled graph can be characterized 
by the sequence of its adjacency sets G = {A{1) , . . . , A{n)} . This algorithm creates 
all the possible adjacency sets for node 1, then for each one of those repeats the same 
procedure on the reduced sequence by that adjacency set (in sense of Definition 3) of at 
most n — 1 nodes. 

Algorithm 1 (All graphs) Given a graphical sequence di > d2 > ■ ■ ■ dn > 1, 

I. Create the rightmost adjacency set Aji{l) for node 1: Connect node 1 to n (this 
never breaks graphicality) . Let k = n — 1. 

1.1 Connect another stub of 1 to k. Run the CG test. 

1. 2 If it fails, make k = k — 1. Repeat 1. 1 

1.3 If passes, keep (save) the connection, make k = k — 1, and if i has stubs left, 
repeat from 1. 1 . 

II. Create the set A{d) of all adjacency sets of node 1 that are colexicographically 
smaller than ^_r(1) and preserve graphicality: 

A{d) = |a(1) = {ai, . . . , OdJ, a, e y A{1) <cl Ar{1), d'\A{i) - graphical^ . 

III. For every A(l) G A{d) create all graphs from the corresponding Q{d'\A{i)) using 
this Algorithm, where d'\A{i) is the sequence reduced by A(l). 

For simplicity of the notation, we will drop the (1) from A(l), tacitly assuming that it 
refers to the leftmost node 1. Observe that the ordering relation "<" in Definition 2 is a 
partial order, while the colexicographic order "<cl" is a total order over all adjacency 
sets, however, "<" implies ''<cl^ ■ It is not hard to see, that Ar is colexicographically 
the largest ( "rightmost" ) sequence which still preserves graphicality. When constructing 
A{d), checking graphicality with the EG or HH theorems is only needed for those 
adjacency sets, which are incomparable by the "<" relationship to any of the current 
elements of ^(d), while for the rest graphicality is guaranteed by Lemma 4. 

5. Discussion and Outlook 



Algorithm 1 proceeds by attempting to connect all stubs of the largest degree node as 
much to the right as possible. Depending on the degree sequence, it might happen that 
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the CG test fails many times at step I., until it finds Aji. However, in that case, Aji 
is located more towards the higher degree nodes (towards left) and thus the number 
of adjacency sets that preserve graphicality, namely |^| is smaller and accordingly, the 
algorithm has fewer cases to run through in subsequent steps. The more heterogeneous 
is a degree sequence, the more likely this will happen. Of course, it only makes sense 
to produce all graphs from Q{d) for small graphs (chemistry), or graphical sequences 
that do not admit too many solutions. An interesting question would then be finding 
the conditions on the sequence d that would guarantee a given upper bound C on the 
size of Q{d), \G{d)\ < C. A possible starting point in this direction could be Koren's 
[18] characterization of sequences uniquely realizable by a simple graph. Sequences that 
admit only a small number of realizations (by labeled simple graphs) would likely be 
"close" in some sense to these special sequences. 

Algorithm 1 also provides a way to computationally enumerate all the labelled 
graphs |^(d)| realizing a degree sequence d (problem B) of Section 1). Naturally, the 
following recursion holds: \Q{d) \ = Yl:A&A{d) l^('^'U)l • Our graph construction process 
can be thought of as happening along the branches of a tree T{d) of depth at most 
n — 1: the internal nodes of this tree on the kth level are all the allowed adjacency 
sets (from the corresponding A set) of the node with the largest residual degree (the 
leftmost node). The reason this tree is of depth at most n — 1 is because some other 
nodes (other than the one with the largest residual degree) in the process might loose 
all their stubs. A directed path towards a leaf of this tree corresponds to a graphical 
realization of d, because wc end up specifying all the adjacency sets along this path. 
Based on this, during the realization of the graph, if we choose uniformly at random at 
every level of the tree within the children of a node, from the corresponding set A, the 
probability of a final realization G in this process will be given by the product: 

P{G) = Y[\A{d'U.,)\-' (3) 

fe=0 

where A^^'> is the randomly chosen adjacency set of the node with the largest residual 
degree, on the k-th level of T{d). By convention A^^^ = 0, and <i'|^(o) = d. It is not 
hard to convince ourselves that the distribution in (3) is not uniform, and thus, this 
algorithm cannot be used in this form to produce uniform samples from Q{d). However, 
Theorem 6 can be used to improve on a well-known, direct uniform sampling process, 
the MoUoy-Reed (M-R) algorithm [8]. In this process, one chooses among all the stubs 
uniformly at random, irrespectively to what node they belong to. This is repeated until 
either a self-loop, or a double edge is created, or a simple graph is finished. When there 
is a self-loop or double edge, the process is stopped, and the algorithm starts from the 
very beginning. The CG test can be used along the way to test for graphicahty after 
every connection just made. In particular, assume that we just connected node i with 
node j. Wc then run the CG test centered on node i (adding the {i-,]) connection to the 
forbidden set from i). If passes, we run the CG test on node j as well. If it fails either on 
i or j, we can stop the process, before actually running into a conflict later. Running into 
conflict usually happens towards the end, and this test can save us from possibly many 



Degree-based graph construction 



11 



unnecessary calls to the random number generator, speeding up the sampling algorithm. 
It is important to note, that if the CG test passes on both i and j, we actually do not 
know whether graphicality is broken or still preserved at that stage! If the CG test fails, 
however, we know that graphicality was broken. The reason is because Theorem 6 gives 
us the sufficient and necessary conditions for the sequence to be graphical such that no 
multiple edges will be made with the already existing connections emanating from the 
same node. It gives no such information for connections already made elsewhere! Of 
course, for degree sequences d for which there is only a small number of labelled graphs 
reahzing it (compared to the total number Yli^^i^- graphs that it produces), the M-R 
algorithm would search for needles in a haystack, and other (MCMC) methods will be 
necessary. 

In summary, we have given necessary and sufficient conditions for a sequence of 
integers di > . . .dn > 1 to be graphical (realizable by simple, undirected graphs) 
avoiding multiple edges with an arbitrary star graph (the forbidden graph) Sk{j), 
< k < n — dj, centered on a node j. In a more general context, our result gives for 
the first time a greedy construction for Tutte's /-factor subgraphs within Kn \ Sk{j). It 
would be desirable if such a greedy construction existed for an arbitrary forbidden graph 
F, not just for star-graphs, however, at this point this still seems to be a rather difficult 
problem. Such an algorithm (if greedy) would further speed up the M-R sampling, 
because it would induce early rejections, as soon as they are made. Our main theorem 
also led to a direct and systematic construction algorithm that builds all graphs realizing 
a given degree sequence d. And finally, we mention that these studies can be extended 
to simple directed graphs as well (there are two degree sequences in this case), however, 
the computations are considerably more involved, and they will be presented separately. 
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